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Basing decisions and estimates on simultaneous approximate 
measurements of noncommuting observables in a quantum receiver 
is shown to be equivalent to measuring commuting projection op- 
erators on a larger Hilbert space than that of the receiver it- 
self. The quantum-mechanical Cramer-Rao inequalities derived 
from right logarithmic derivatives and symmetrized logarithmic 
derivatives of the density operator are compared, and it is 
shown that the latter give superior lower bounds on the error 
variances of individual unbiased estimates of arrival time and 
carrier frequency of a coherent signal. For a suitably weighted 
sum of the error variances of simultaneous estimates of these, 
the former yield the superior lower bound under some conditions. 
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1. Quantum Measurement 


Quantum detection theory has been developed within the conventional frame- 
work of quantum mechanics, one of the principal tenets of which is that only 
observables associated with commuting operators can be simultaneously measured 

[ 1 - 3 ] 

on the same system. It has been suggested that this formulation is too. 

restrictive, that noncommuting operators can be at least approximately measured 

on the same system, and that to include this possibility may permit more effec- 

[4 5 ] 

tive detection, appraised at a lower average Bayes cost. ’ An explication 
of this proposal requires prior discussion of quantum measurement in general. 

r ft— 8 1 

A more thorough treatment can be found in many a textbook on quantum mechanics. 

In quantum mechanics an observable is associated with an Hermitian op- 
erator, say F, which possesses an array of orthonormal eigenstates |f n ) and eigen- 
values f , defined by 
n 


F|f ) = f If > . (1) 

n n ' n 

The operator U n )^ n l projects an arbitrary state vector onto the eigenvector 
|f n ), and these projection operators |f n )(f | commute for different indices n 
and sum to the identity operator. 


n 

we say that they form a complete, commuting resolution of the identity. 

The primary measurement concept of quantum mechanics envisions the mea- 
surement of an observable F as performed by an ideal apparatus that applies to 

the system such a resolution of the identity. The state of the system, origin- 

[91 

ally — let us say — described by a density operator p, is projected onto one 
and only one eigenstate If^), and the apparatus registers the associated eigen- 
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value f which is termed the outcome of the measurement of F; the probability 

n’ 

of this event is 

Pr (f n ) = Tr [P|f n ><f n l] = < f n l p l f n > ’ (3) 

where 'Tr' stands for the trace of an operator. Because Tr p = 1, (2) shows 
that these probabilities sum to 1. (An operator may be "degenerate*", possessing 
a set of several eigenstates for each eigenvalue f . A practical measurement 
of F may then project the initial state onto the subspaee spanned by that set.) 

More generally, any finite or infinite resolution of the identity into 
commuting projection operators E , 

} ^ = ^n^m = ^n ‘■’nm > 

n 

is in principle measurable by some apparatus. A projection operator has eigen- 
values 0 and 1, and the apparatus measuring the set {E n > will register the value 
1 for one of them, say E^, and the value 0 for the rest; the probability that 
this happens is Tr(pE^). The fundamental role of projection operators in quantum 
measurement was emphasized by von Neumann. 

Commuting operators share a common resolution of the identity into com- 
muting projection operators, and to say that commuting operators are being mea- 
sured simultaneously on a system is only to assert that the apparatus is apply- 
ing to it their common resolution of the identity. The eigenstates composing 
the resolution bear multiple parameters, each of which is associated with one 
of the commuting operators . The simultaneous eigenstates |xyz) of the 
operators X, Y, and Z corresponding to the three rectilinear coordinates provide 
one example; another is the set of simultaneous eigenstates of the energy H, 
the total angular momentum L, and the component of angular momentum along 
an arbitrary axis, for a particle in a spherically symmetrical potential. 

The observables X, Y, and Z — or H, L, and — are said to be simultaneously 
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measurable or compatible. The simultaneous measurement of noncommuting obser- 
vables has no meaning in the sense of this primary concept of measurement. 

Gordon and Louisell have advanced a secondary concept of measurement that 

may lend meaning to the at least approximate simultaneous measurement of non- 

[ 12 ] 

commuting observables. They remind us that there exist overcomplete resolu- 

tions of the identity in terms of noncommuting projection operators’. The most 
familiar example involves the coherent states |a), which are the right eigen- 
states of the annihilation operator a that plays an important role in the quantum 
theories of harmonic oscillators and boson fields [13], [8, §3.7], 

a | a ) = a | a } , 

a = a + ia . (5) 

x y 

The real part a^ of a is proportional to the coordinate operator, the imaginary 
part a^ to the momentum operator, and these do not commute, nor does the non- 
Hermitian operator a commute with its conjugate a + , the creation operator, 

aa+ - a+a = [a, a + ] = 1. (6) 

The eigenvalues a = + i a y are complex, and the states |a) are overcomplete 

in the sense that 


T -1 


|a)( a d 2 a = 1, d 2 a = da x da. 


(7) 


As these states are not orthogonal, 

<a|e) = exp (a*£ - y|a| 2 - (8) 

the projection operators |a)<oi| do not commute. 

Because of (7) it is tempting to assert that the quantity P(a ,a v ) = 

x y 

7T — 1 (ot|p|a) represents the joint probability density function (p.d.f.) of the 
outcomes a x and ay of simultaneous approximate measurements of the noncommuting 
operators a x and a^ when the system has the density operator p, for 
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Generalizing to arbitrary multiparameter noncommuting resolutions of the iden- 
tity, Gordon and Louisell defined an 'ideal* measurement as one yielding out- 
comes having a joint p.d.f. calculated in this way, and the nonorthogonal states 
corresponding to the coherent states |a)were termed 'measurement states'. They 
showed how such an ideal measurement can be carried out on a system S by causing 
it to interact for a time with a second system A, termed the apparatus. After 
the interaction, commuting observables are measured on the apparatus A, or more 
generally on both S and A; and from the outcomes of these measurements, values 
of a x and oiy are deduced that have a joint p.d.f. given by P(a x , ay) as required. 

2. Detection 


Let us apply to quantum detection the scheme of allowing the system S, 
which is now our ideal receiver, to interact with a suitable apparatus A. The 
decision about the received signal will be based on the outcome of measuring 
commuting projection operators on the combined system S+A. We suppose that an 


observer is to decide among M hypotheses about the state of the 

receiver. It interacts with the apparatus A for a while, and at a later time 


t the density operator for the combined system under hypothesis H_. is p (t) . 
Let {IL) be a set of commuting projection operators forming an M-fold resolution 
of the identity, 



( 10 ) 


On the combination S + A we are to measure these M projection operators at time 
t, and if the k-th yields the value 1, hypothesis H k is selected as true. The 
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average cost is then [1] 


C 


M M 

EE'i 


i=i j=i 


C ±i Tr[p? +A (t)n.] , 


( 11 ) 


where is the prior probability of hypothesis H^ and CL_. is the cost of choos- 
ing H-j_ when Hj is true. Let {Hj (t) } be the projection operators that minimize C 
when the system S + A is observed at time t; we call these optimum. 

In the Schrodinger picture the density operator p? +A (t) is related to the 

S+A , . .. „ , [14] 

density operator Pj (tp) at an earlier time tp by 


p? +A (t) = U(t,t 0 ) Pj +A (t 0 ) U + (t,t 0 ), 
with U(t,t 0 ) a unitary operator obeying the Schrodinger equation 


( 12 ) 


ih U (t , t 0 ) = HU ( t , t o ) , 


U(t 0 ,t 0 ) = 1, UU+ = 1, (13) 
where H is the Hamiltonian (energy) operator for the combined system S + A and 
h is Planck's constant h/2iT. 

As the set (llj[(t) } of projection operators minimizes the Bayes cost C 
when measured at time t, by (12) the operators 


Hj (t o) = U + (t,t 0 ) Hj(t) U (t , 1 0 ) (14) 

minimize C when S + A is observed at time tp. Because of the unitarity of 
U(t,tp), the set (^(tp)} also forms an M-fold resolution of the identity into 
commuting projection operators, and the Jlj(tp) are optimum at time t 0 . Since 
the minimization is carried out over all possible M-fold resolutions of the 
identity, the minimum Bayes cost C m ^ n must be independent of the observation 
time t. To this independence Helstrom^^ and Liu^^] have previously adverted. 

Now let us roll time back to an epoch tp before the system S has come 
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into contact with the apparatus A. In the Schrodinger picture this amounts to 
applying the inverse unitary transformation U"*"(t,to) to the state vectors of 
the combined system S + A. Because S and A are independent at this time to, 
the density operators p S+A mus t now have the factored form 

p S+A (t 0 ) = p. S (t 0 ) 0p A (t o ), j = 1,2 M, (15) 

3 

on the product of the Hilbert spaces for system S and apparatus A. Furthermore, 
as the apparatus A before the interaction has no information about which hy- 
pothesis is true, p A (tg) in (15) must be independent of j. The Bayes cost is 
now 


C = Tr 



>V 0 >® Pj S c t 0 > 


(16) 


Let us define positive definite Hermitian operators acting on the 
Hilbert space g of the system S alone by 


Q ± - Tr A [ P A (t 0 ) n i (t 0 )] , 


(17) 


where Tr indicates a trace with, respect to the Hilbert spaced of the apparatus 
A A 

A. Then the Bayes cost can be written 


C = Tr, 


M M 

V' V' 


Lj La c 3 c ij p i Q i ; 


(18) 


i=l j=l 


we consider henceforth only the epoch t 0 and dispense with labeling our operators 
therewith. The operators form a resolution of the identity l g for the system 



(19) 


i=l 


but they do not necessarily commute and thus may not be measurable on system 
S alone. 
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Although it is not immediately obvious, any set of suitably 

well-behaved positive-definite Hermitian operators that satisfy (19) 

can be expressed in the form of (17) t 17 » 18 j| . Spec ifi ca 2 iy t ± t i s possible 

to find a Hilbert space ^ ^ and a density operator such that in the 

product space ® C/C there exists a set { H . 1 of commuting projectors that 

when substituted into (17) yield the given Q^. There is considerable arbitariness 

A 

in the choice of p ; the spaced, must in general have infinite dimension. 

A 

A 

The density operator p may be taken as representing a pure state of the 
apparatus , 


P - I VX'f.l. 


( 20 ) 


Let J w^) be a complete orthonormal set spanning the Hilbert space The 

vectors ) 1 4^ ) then span a linear manifold in jjg 0 within which the 
operators 0 p A form a noncommuting resolution of the identity. By virtue 
of a theorem of M. A. Nalmark's^^ , this resolution can be extended to a 
resolution of the identity into commuting projection operators satisfying 
(10) and (17). 

The equivalence of the constraints provided by (17) and (19) allows 
us to view the cost minimization problem as one of minimizing (18) over all 
sets of positive-definite Hermitian operators that satisfy (19). A special 
form of this problem was first posed and solved by Yuen but its 

equivalence to the original minimization problem was not then established.- 

If the Q i that minimize the Bayes cost C in (18) do happen to commute, they 


must possess at least one common set of eigenstates |v^ ) and can be written 
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( 21 ) 


Q i ' ’ 

j h 

0 s «ij XKij ‘ 1 • 


i=l 


Then the optimum strategy is to measure the projection operators |v.)(v. | in^ 

3 J S 

and to choose hypothesis with probability q^ k when the measurement of 
|v k )<v k | yields the outcome 1. This would in general be a randomized strategy. 
However, once the set { | ) (v^ | } has been specified, the decision process can 
be treated by classical decision theory in terms of the probabilities Pr{v k |H_.} = 
< v k |p 3 S |v k > and the likelihood ratios formed from them. As we know that the 
minimum Bayes cost can be attained by a pure strategy, the q^ k 's must be either 
0 or 1, and the Q-^’s must be projection operators. If we then take IK = 1 A 
as the decision operators in the product space of S + A, we can attain the same 
minimum Bayes cost C . . Furthermore, C . will be independent of the density 
operator of the apparatus, and measurements on both system and apparatus 
cannot lead to a lower Bayes cost than measurements made on the system S alone. 

There are three cases in which the optimum Bayes strategy is known to 
require measuring commuting projection operators on the space of the receiver 
alone: (a) binary decisions (M = 2)^^, (b) decisions among M commuting density 

operators^^, and (c) decisions among M linearly independent pure states^^. In 
general, however * there is no guarantee that the positive-definite Hermitian 
operators Q i that minimize the Bayes cost C of (18) subject to (19) xvill commute. 

To the contrary, A.S. Holevo^^ has shown, by example, that minimum Bayes cost 
may sometimes be attained by a set of positive-definite noncommuting operators 
in the Hilbert space He also showed that, in general, the minimizing operators 
obey the conditions 
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( 22 ) 


(K. - A) Q. = 0 K. 

i i ,1 


- A > 0, i =1,2, M, 


where 


and 


M 

K. -V ;.c..p s 

1 J 1J J 

j=l 

M 

K i q i 

i=l 


(23) 


(24) 


For t, . = M C . , = 1, i t j , C.. = 0, these reduce to the conditions derived 
J ij 

by H. Yuen . If the minimizing operators Q_^ turn out to commute, they 

can be measured on the receiver itself. Otherwise an extension to a set of 
commuting projection operators II. in a product space .'/f ® .y€. must be determined, 
and if the minimum Bayes cost is to be actually attained, some way of measuring 
them must be invented. 
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3. Estimation 


Determining the values of signal parameters such as amplitude, carrier 
frequency, and time of arrival corresponds in quantum theory to estimating 

S 

the parameters 0 .., 0 ., ..., 0 of the density operator p (0 , 0 ^, 0 )■ 

of a receiver of the signal. Parameter estimation is' a continuous version 
of multiple hypothesis testing; costs of errors as functions of the true 
values and the estimates lead to an average cost similar to C in ( 11) ; and 
by an argument like that in § 2 the minimum cost can be shown to be independent 
of the epoch at which the receiver is measured. One can again expect that 
attaining the minimum cost may require measurements of commuting operators 
in a product Hilbert space ® of the receiver S and an auxiliary apparatus 

o A 

A. Holevo^^has treated both estimation and detection from this point of 

view. In general, the operators that must be measured in order to minimize 

an arbitrary cost function will be difficult to calculate. The minimization 

of a quadratic cost function in estimating a single parameter 0 has been 

r 211 

achieved by Personick , and Yuen and Lax have similarly derived the 

optimum estimates of a pair of parameters combined into a complex number, the 

[ 22 ] 

cost being the sum of the mean-square errors in each; 

Bounds can be set below the mean-square . errors of unbiased estimates of 

parameters of an ordinary probability density function by means of the Cramer-Rao 

inequality of statistics. A quantum-mechanical counterpart to it has been 

formulated in terms of the symmetrized logarithmic derivatives of the density 
S [ 3 ] 

operator p . Yuen and Lax have produced a similar inequality, but have 
restricted it to the simultaneous estimates of pairs of parameters combined 

[ 22 ] 

into complex numbers, one pair for each mode of a multimode quantum receiver. 

We x^ish to show how their method can be applied to setting bounds below the 
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variances of unbiased estimates of arbitrary real parameters of a density 
operator. The role of noncommuting observables measured on the receiver itself 
will become apparent, and we shall see that sometimes the one, sometimes the 
other type of bound is superior. 

A quantum-mechanical system S is in a state described by a density operator 


p b t0) that is a function of a number m of unknown real parameters = ^ 0 l’®2’'' 

0 ), whose values are to be estimated. The procedure for doing so may require 

creating an auxiliary system A, which will be called the apparatus and which 

A 

is in a state described by the density operator p independent of _0_; the 
density operator for the combination of system and apparatus is the tensor 
product 

r> a 

(25) 


p (ej = p S (e) @ P A . 


On this combination a set of commuting Hermitian operators 0,, 0„ 0 are 

12 • m 

A* A a 

measured, and the outcomes 0,,0„,...,8 of the measurement are taken as the 

12m 

estimates of the parameters 0. These estimates are simultaneously eigenvalues 

A 

of the m operators 0. with eigenstates |0) defined in the product space of 


S + A, 

AAA A A A 

0j | 0) = 0^ | 0) = (®2 ,0 2 ,, ** , 0 m^ ’ d — -l- > 2 , . .m. 

We postulate that the estimates are unbiased, 

Tr( p 0.) = 0 4 s j = 1,2,..., m. 


(26) 


(27) 


J "3 

An inequality of the Cramer-Rao type is now derived by a method used by 

[ 22 ] 


Yuen and Lax. 
defined by 


The right- logarithmic-derivative (r.l.d.) operators are 


9p T T + 

30 = pL j = L j P ’ 

Equation (27) and the fact that Tr p = 1 give as usual 


Tr 


- 12 - (0. - 0.) = Tr p(0.-0.)L. + = 6. . 
j J M J 3 k kj 


30, 


( O.k^.j 


(28) 


(29) 
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( 30 ) 


Multiplying by the complex numbers y . and z and summing we get 


3=1 k=l k=l 

which can be written 
Y t Z = Tr PQ + , 

where Y and Z are column vectors of the v 's and z ’ s respectively, and P and 

'j 3 


Q are the operators 

1/2 * 

p - p E cg j - e J > * 


(31) 


3=1 


1/2 

Q = P 


m 


V z. L.. 

/ . 3 3 

3=1 

The Schwarz inequality for traces, 

| Tr PQ + | 2 £ Tr PP + Tr QQ + , 

yields 

| Y + Z | 2 £ (Z + AZ) (Y + BY), 

where A and B are Hermitian matrices whose elements are 

A. . = Tr P L.L. + , 

13 13 

B, . = Tr P (0. - 0 .) (0. - 0 .) 

13 1 x j j 

' /■» A 

= cov (0 ± ,0 ). 


(32) 


(33) 


(34) 


(35) 


(36) 


The diagonal element B.. is the variance Var0. of the estimate 0.; cov 
0 11 x x 

stands for covariance. Equality holds in (.33) if P and Q are proportional, 
that is if 

m -m 

p Z - V - W’-E L r <37) 

3=1 


3=1 


1/2 


where we have multiplied both sides by p for later use. 
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Here k(0) is a c-number function, not an operator. 

If we put Y = B ■*" Z, we obtain from (34) 

Z + B _1 Z < Z + A Z (38) 

— ~ r* ~ ~ ~ 

the left-hand side of which, when the z 's are real variables, is a quad- 

j 

ratic form that when set equal to a constant ^m + 2J specifies the concen- 

[231 

tration ellipsoid of the estimates 1 ; and (38) states that that concen- 
tration ellipsoid lies outside the ellipsoid specified by the equation 
Z + A Z = m + 2. Alternatively we can put Z = A ^ Y to obtain the mul- 
tivariate Cramer-Rao inequality 


-f* 4* — 1 

Y B Y >_ Y A Y, 


(39) 


which for various choices of the y.'s yields inequalities among linear 
combinations of the variances and covariances of the estimates. Further- 
more, a quadratic risk function can be set up in terms of a positive— de- 
finite matrix G as 

R = Tr G B, (40) 

and by writing G in terms of its eigenvalues and eigenvectors one can show 
that the risk R must exceed Tr G A ^ 

A 

Because the density operator p has the product form in (25) with p 
independent of 6, the r. l.d. operators L. must have the form 


L . = L , 0 1. , 

T "A 7 


(41) 


where the r.l.d. operators defined by 

r, S / S S 

dp / 90j = p 


(42) 


act only on the Hilbert space of the system S, and 1 is the identity operator 
for the Hilbert space of the apparatus A. Thus the matrix elements A de- 


fined by (35) become 

" *s‘p\V> 


M3) 


14 



and are independent of the density operator p for the apparatus. These 

bounds, therefore, have a universal validity in the sense that they hold 

no matter what auxiliary apparatus A is brought up and no matter what state 
A 

p it is in. They apply also to measurements of commuting estimators on the 
system S alone. 

If we put into (37) Z = A ^Y, we find that equality in (39) is attained 

if there .is a set of commuting estimating operators 0. such that 
m m m J 

°E y / (e 3 • v ■ kc?) p Y Y y / <6 ' 1) ^ L k 


j=i 


3=1 k=l 


or 


m 


P S © p A V' y.*(0. - e.) 

L-J 3 3 3 


E E ’jVVfrV) 


® p A . 


(44) 


j=l 

m m 

= k(0) ’ 

3=1 k=l 

Now, following Holevi^^e suppose that there exists a mapping of the parameter 
space 0 into a set of non-negative Hermitian measurement operators X(F) in 
the Hilbert space <3^ of the system S such that F -*■ X(F)., where F is any 


region of the space 0, 0 -»■ lg, and 


E 


X(Fi> - l s . 


(45) 


when {F.} is a countable measurable decomposition of the space 0 into disjoint 


regions F_^. Let X(0 ; d m 0) be the operator corresponding to the differential 


,m„ 


region of volume d 0 in the neighborhood of the point 0 of the parameter space, 
and define the operators 


0 S =( 0 X(0 ; d m 0), 0 = (0,0 ) 

J J J * -L m 


0 


(46) 
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s 

The operators 0. may not commute, but suppose they satisfy the equation 


m 

.m m 

■^E E y / ( ~~Ev s - 

(47) 

3=1 

3=1 k=l 



Then one can interpret the results of Holevo's work as asserting the existence 

of an apparatus A, a product spaced ® , a pure state p^=|T ) A | of 

b A A A 

the apparatus, and a set of commuting estimators {0.} in C/C ®<yC A such that 

3 b A 

(44) holds and the equality in (39) is attained. This set of estimators is 

A 

constructed by starting with the set of operators X(F) © p acting in the 
subspace © | ¥ ) of C/C c ®C/C .. that is spanned by the tensor products 
| w..) 1 4^) of and each of a complete orthonormal set of vectors |w^) spanning 

C/C . By using Naimark's procedure this noncommuting set of operators can 
be extended to a commuting resolution of the identity in c/£ o ® C/C > 

u A. j 

X(F) ® p A x' (F), 


where 


E*'<v^ 


S+A 


(48) 


for any countable measurable decomposition of the parameter space 0 into dis- 
joint regions F^. Projection of the operators X' (F) into the subspace 

A 

C/C © Jy ) yields the operators X(F) ®p . Then the estimators defined by 


9 j = / 6 j X ' ’ d 
0 

will commute and satisfy (44), for 

(•p S 0 p A ) , e j =p S J xcejd^e 


(49) 


; d 9 ) ® p‘ 


0 

S ~ S A 

= p b 0 b © p , 


(50) 
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which when applied to (47) yields 044). In order to carry out this extension 

a product space x ^ of infinite dimension is required. As Holevo pointed 

A 

out, the state p and the apparatus are not necessarily unique; many such states 

in many such product spaces /yjf ©.^ nay exist that permit achieving equality 

b A 

in (39). 
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4. Estimation of Complex Mode Amplitudes 

As an example we use the well-worn problem of estimating the real and 

imaginary parts of the complex amplitude p = p + ip of a coherent signal 'in 

x y 

a harmonic oscillator representing a single mode of a quantum receiver with 

[13] 


thermal noise. The density operator is, in Glauber's P-representation 
P S (p) = (ttN) 1 J' exp ^ -|a-p| 2 /N j | a ) < a | d 2 a , 


(51) 


where la) is a coherent state, a = a + ia , d a = da da , integration is 

x y x y 

taken over the entire complex a-plane, and N is the mean number of thermal 

2 

photons in the mode. The normalization is such that |p| is the mean number 
of photons contributed by the signal. 

By (2o) and eq. (3.20) of [1] the r.l.d. operators corresponding to 

S S 

3p /3p and 3p /3p are 

x y 


N 


+ 


N+l * 


_ . / a' + a' \ 

1 \ N N+l / ’ 


(52) 


where a' = a_-p and a„ is the annihilation operator for the mode, obeying 

O D 

the usual commutation rule a g a g + ~ a s +a s = ~g' Using this and Tr p a^ d = N, 
we can easily derive the matrix A of (35) and its inverse. 


and 


A = 


K -1 


1 

2N+1 

-i 

A _1 - 1 

’ 2 N+l 

i 

N (N+l) 

i 

2N+1 

’ - "4 

-i 

2N+1 


[L S -| 





S x 

s 



L 


P c - P 

y 


L s y 


(53) 


(54) 
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where 


2 ( V a S ^ ’ P S 2 1 ^ a S “ a S^ 


(55) 


are proportional to the coordinate and momentum operators for the mode. If 
we now take y [1 OJ and y" P= [0 1J we obtain from the inequality .(39) 
the bound! 


Var u and Var y > (N •+■!■) . 
x y — 2 2 


(56) 


By (47) the operator achieving the minimum value of the variance Var y^is Qg> 
and the operator minimizing Var y^ is Pg. These do not commute and cannot 
be measured simultaneously on the same system S in such a way that both attain 


the minimum variance y(N + —) . 


If on the other hand we take y = [ 1 -ij, we find from (39) for simultaneous 


commuting 

measurement of the . estimators M of y and M of y 

a x x y y 


Var y + Var y = 
x y 


Tr 


p[( M x" y x) 2 + ( M y~ y y) 2 ]- N + 1 ’ 


(57) 


[ 22 ]. 


an inequality discovered by Yuen and Lax. ‘•"^-'if we use (54) with y + -jj 

in (47) we find that inequality is achieved in (57) by the noncommuting opera- 
tors M^' = Qg, M ' = Pg. The noncommuting resolution of the identity correspon- 
ding to (45) is provided by the operators 


X 


( y ; d y ~ I n X y) d 2 y/ir> 


(58) 


in terms of the coherent states |y>, and the real and imaginary parts of 

V 1 P S = a S = f v 1^ > < y | d 2 y /u (59) 

correspond to (46) . 

It is known that equality in (57) is achieved by measuring the commuting 
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[ 24 ] 

operators 

M x “ Q S + V M y - P S - P A <60) 

on a combination of the system S and another harmonic oscillator A in the 


ground state |0 ), the density operator being now 

A 

P = P S © l° A > < ° A I • (61) 

Here Q and P are coordinate and momentum operators for A. Equation (44) requires, 

A A 

with k(y x ,vO = 1, y + = [1 -i] , 

( P S ® P A ) + iM y j = P S a g ® P A 

which is satisfied by (60) because 

pA («A - 1 P A ) " »' ' \ ' I °A><°aK + ' °' 
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5. Estimation of Essential Signal Parameters 

By essential signal parameters we mean those intrinsic to the form of the 

signal, as distinct from its complex amplitude ; the arrival time t and carrier 

frequency ft are the essential parameters we shall deal with here. We shall see 

that when these are measured separately, the Cramef-Rao inequality based on the 

[ 3 ] 

symmetrized logarithmic derivatives (s.l.d.).2? yields ^greater lower bounds on 

the error variances than does the inequality (34) based on the right logarithmic 

derivatives (t.l.d.) L.. 

3 

The s.l.d.j^. is an Hermitian operator defined by 




(62) 


.[3] 


The corresponding Cramer-Rao inequality has the form 
Y B' Y >_ Y A' -1 Y, 

where the column. vector Y has real- elements and Y = (y n y„. 
of the matrices A' and B' are 


• y m> 


A'ij = 2 Tr + ^j^i), 


(63) 

The elements 

(64) 


B' = j Tr p (©' Q f + O' 0{) ; 0 ' = 0 . _ e . (65) 

If one goes through the derivation of (63) as in reference [3], one sees that 

it does not require that the estimating operators 0^ commute. Furthermore, the 

S A 

density operator p may have the product form p (B) © p corresponding to a 
combination of the receiver S and an. ignorant apparatus A. The s.l;d. operators 
will then take the form 


=3'i S © 1 A (66) 

s s 

where the s.l.d. operators 5?. are given by (62) with p replaced by p (6)* The 

g 

matrix A' then depends only on p , and the bound in (63) has a universal validity f 
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applying no matter what apparatus _A is used nor what state p it has. 

We consider estimates of parameters 0 of a coherent signal with complex 
envelope 


g(6, t) = 6 f Ct-T)e ifi(t T) , - 


8 = + i$ , 0 = C& x , B , t, fl), (67) 

t being the arrival time and fi the deviation from a reference carrier frequency 

£2 q . The signal field is spatially coherent over the aperture of the observing 

[25] 

instrument, so that (67) represents the amplitude of a single spatial mode. 

The signal occupies such a narrow band of frequencies that the thermal noise 

can be considered to have a spectral density independent of frequency, and the 

observation interval (- y T, -|t) is assumed so long that the possibility that 

the signal overlaps either end of it can be disregarded. We then expand the 

signal into temporal modes 

- 1/2 

y^( t) = T exp ico^t. = 2irk/T, (68) 

whose amplitudes are 

T/2 

8(0, t)Y k *(t) dt »• 

-T/2 


V?) -/ 


- 1/2 

6T exp (-iu^x) F(u^-n), 


(69) 


where 


-f 


F(w) = / f(s) e 1WS ds 


(70) 


is the spectrum of the signal. The density operator p (0) now has a Gaussian 
P— representation, and as in (38) of reference [3] , 


a P /30_. n -1 

k 


9vi k * 8y k + * 

wr (a k ■ M k> p + p a?: ( \ - y k > 


(71) 
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+ 


where N is the mean number of background photons per mode and a^ and a^ are 

the annihilation and creation operators for the k-th mode. The right logarithmic 
[26J 


derivative is now 
L. = 

k 




^ ar <a k + - O + w 1 w Ca k - V 


(72) 


from which, with Tr pa? + a^ = N, aJ = a, - p, , we find for the matrix A of (35) 

K |C ' 


\ = V y k 


the elements 


ij , I ae ± 38 

k d 


k , n -l 9p k 3y k 

+ (N+l) gg- 3Q- 

1 1 


N 


-1 


8 2 G 


86, . 96„ 

li 2j 


+ (N+l) 


-1 8 G 


80~ . 80 . 
2i 13 


01 = e = 0 


(73) 


where 


G <?V?2> -Yj V A ) P k‘ ( ?2> ' 


? 1 B 2 J F ( w_fi 1 ) F (w-ft 2 ) ex P _itJ ^ T i~' r 2 ) ] <W2ir 


(74) 


in the limit T -> 00 . This is conveniently written in terms of the ambiguity 
[27] 


function as 


G(e i ,§ 2 ) = 0 1 B 2 exp | - ~ i(^ 1 +Q 2 ) (T]|_-t 2 ) j A ( T 1 -T 2 

By using eq. (2.11) of ch.X of reference [27], we find the derivatives 
3 2 g /36i x S6 2x - 3 2 G/38 ly 38 2y - 1. 
3 2 G/36 lx 38 2y --3 2 G/38 ly 36 2x .-i, 

8 2 G/8t 1 8t 2 = ) 3 J 2 Am 2 , 

8 2 G/8t 1 8w 2 = - Je| 2 [ A(«ot) + j i], 

8 2 G i /8t 2 8w 1 = - f 3 1 2 |a(cot ) - i ] , 

8 2 G/8w 1 8w 2 = / 3/ 2 Ax 2 , 


(75) 


( 76 ) 
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where we have set 6^ = 0^ = 0 and used the definitions of the mean-square 
2 2 

duration At , mean-square bandwidth. Aw , and mean duration-frequency product 
A(wt) as given in reference [27J, ch.I. §5, and we have taken 

t = 0, w + ft = 0, (77) 

where t is the mean epoch and w the mean frequency deviation, of. the signal. 
The matrix A now has the block form 


A = 


A 
~ 1 

0 



(78) 


with A^ given by (53); the elements of A^ are 

A 2tt = (2N+l)j 8| 2 Aw 2 /N(N+l), (79) 
A 2rfi = A 2ftr = ~l 2 [ (2N+l)A(wt) + |i ] /N(N+1) , (80) 
A 20fl = ( 2N+ D|e| 2 At 2 /N(N+1). (81) 


Thus the bounds on the error variances as given by the Cramer-Rao inequality 
(39) based on the r.l.d.'s L. depend on the matrix A 2 \ whose elements are 


/A 2 _1 j = & (2N+1) At 2 , (82) 

TT 

Tft 

(a -1 ) = & (2N+1) Aw 2 , (84) 

'an 


where 


& = I 8) 2 N(N+1) 


2 2 

D = Aw At - 


(2N+1) ‘ 

2 


D -rl 


A(wt) 


(85) 


In particular, (82) and (84) give the lower bounds on Var t and Var ft respecti- 
vely. 
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From the symmetrized logarithmic derivatives as given by (39) of 
reference [3] we obtain, as in (43) therein, the matrix elements of A' in (64), 


A!. = 4C2N+1) 1 Re 3 2 G/36 1 .36- . ] 
ij 2 J I 


( 86 ) 




where Re indicates the real part. From (76) this yields the matrix A 1 in block 
form 


A 1 9 


~2 


(87) 


where A^ is diagonal with diagonal elements -|-(2N+1) , and the elements of 


A 2 3re 


A 2tt ’ 4(2N+1) 1 ( 6| 2 A«i 2 , 

A 2,fi - a L- - lei 2 

A 2fla - 4 « N+1 >- 1 lei 2 ^ 2 - 

* -1 

The elements of the matrix A^ needed in (63) are now 

(4 _1 ) = i(2N + D|8j- 2 At 2 / Dj 

TT 

(~2 _1 ) = >- Dier 2 Mwt)/ Dj 

(~2 ,_1 ) = ^(2N+l)|e| 2 6 w 2 /D, 

£2 £2 


( 88 ) 

(89) 

(90) 

(91) 

(92) 

(93) 


with D given in (85). In particular, (91) and (93) give lower bounds on the 

A A 

variances Var t and Var £2 of estimates of arrival time t and carrier-frequency 
shift £2. These bounds are larger than those given by (82) and (84), except 
when B ' = 1/4. The minimum value 1/4 is attained by d when the signal has 
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a Gaussian envelope [27, p.20]; the s.l.d.'s and the r.l.d.'s then yield equal 
lower bounds. 

If, on the other hand, we invent a risk function of the form 


R = 


Var t 


Var ft 


(94) 


At Am 

find, upon using (39), (79)-(81), and y + = | Am iAtj, that when the operators 
estimating the arrival time t and the frequency shift ft commute. 


we 


R > 


2N(N+1) 

|8| 2 AmAt (2N+1) AmAt - 



(95) 


here we assumed for simplicity that A(mt) = 0. The s.l.d. bounds in (91) and 
(93) yield, on the other hand, 


R L \ (2N+1)|B| 2 Am~ 2 At~ 2 =R 2 (96) 

and R^ >_ R 2 for -|-j< Am At N + Thus if the arrival time and carrier 
frequency of the signal are to be measured in the same receiver, ther.l.d. 
Cramer-Rao inequality (39) sometimes yields a superior lower bound on a weighted 
sum of the error variances to that provided by the s.l.d. inequality (63). 


Efficient estimators for essential parameters such as arrival time and 
carrier-frequency do not exist, and the lower bounds derived here and in [3] 
have only an asymptotic significance. Both sets go into the classical forms 
in the limit N>>1. 

For simultaneous estimation of the complex amplitude B = B^ + iB the 

/S A 

r.l.d. bound on Var B^ + Var B is, as we have seen, greater than the s.l.d. 
bound, and from this we gain an instructive insight into the role of noncommuting 
observables. A similar result was found for the special risk function in (94). 
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For single measurements of other parameters than the amplitude, however, the 
s.l.d. bound may be superior. 

Conclusion 

In order to realize minimum Bayes cost in quantum detection and estimation 

by measurements of commuting operators it may be necessary to couple the receiver 

S with an auxiliary apparatus A and perform the measurements on the combination. 

The resulting Bayes cost is the same as if commuting operators were somehow 

measured on the pair S + A before they interacted, and the optimum strategy is 

equivalent to a resolution of the identity into possibly noncommuting, positive- 

definite Hermitian operators acting in the Hilbert space of the receiver 

alone. The minimization of the Bayes cost can, therefore, be carried out over 

the class of such noncommuting operators in which having been found can be 

S 

extended to a resolution of the identity into commuting projection operators 
in an encompassing Hilbert space characterizing a combination of the 

receiver S with some auxiliary apparatus A initially ignorant of the state of 
the receiver. Minimum Bayes cost can be attained without violating the quantum- 
mechanical restriction of simultaneous measurement to the class of commuting 
Hermitian operators. 

There are two types of Cramer-Rao inequalities setting lower bounds to 
mean-square errors and quadratic loss functions in unbiased estimates of 
parameters of the density operator of a quantum-mechanical receiver. Sometimes 
one, sometimes the other yields the superior bound. The difference between 
them lies not in how they handle the commutativity of the estimating operators, 

but in their definitions in terms of right-hand or symmetrized logarithmic der- 
ivatives. Both can be applied to a combination of the receiver S with an ar- 
bitrary ignorant auxiliary apparatus A, on which combination S + A commuting 

observables are measured. The lower bounds they assert depend only on the 
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parameters of the receiver S. Because commuting estimators on S + A can be 
reduced to possibly noncommuting estimators on the receiver S alone, both 
inequalities apply to these as well. 
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